Picture for Yuan Liu

Yuan Liu

The University of Hong Kong

UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception

Add code
May 29, 2026
Viaarxiv icon

DiffSpot: Can VLMs Spot Fine-Grained Visual Differences in Web Interfaces?

Add code
May 28, 2026
Viaarxiv icon

VersusQ: Pairwise Margin Reasoning for Generalizable Video Quality Assessment

Add code
May 20, 2026
Viaarxiv icon

OpenCompass: A Universal Evaluation Platform for Large Language Models

Add code
May 19, 2026
Viaarxiv icon

Real2Sim in HOI: Toward Physically Plausible HOI Reconstruction from Monocular Videos

Add code
May 14, 2026
Viaarxiv icon

AutoVQA-G: Self-Improving Agentic Framework for Automated Visual Question Answering and Grounding Annotation

Add code
Apr 19, 2026
Viaarxiv icon

POINTS-Seeker: Towards Training a Multimodal Agentic Search Model from Scratch

Add code
Apr 15, 2026
Viaarxiv icon

Beyond Transcription: Unified Audio Schema for Perception-Aware AudioLLMs

Add code
Apr 14, 2026
Viaarxiv icon

POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs

Add code
Apr 13, 2026
Viaarxiv icon

UniRecGen: Unifying Multi-View 3D Reconstruction and Generation

Add code
Apr 01, 2026
Viaarxiv icon